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Abstract. We introduce the notion of a locally scaling transformation defined on a compact- 
open subset of a non-archimedean local field. We show that this class encompasses the Haar 
measure-preserving transformations defined by (in particular, polynomial) maps, and 
prove a structure theorem for locally scaling transformations. We use the theory of poly- 
nomial approximation on compact-open subsets of non-archimedean local fields to demon- 
strate the existence of ergodic Markov, and mixing Markov transformations defined by such 
polynomial maps. We also give simple sufficient conditions on the Mahler expansion of a 
continuous map Zp Zp for it to define a Bernoulli transformation. 



1. Introduction 

The p-adic numbers have arisen in a natural way in the study of some dynamical systems, 
for example in the study of group automorphisms of solenoids in Lind and Schmidt [LS94]; 
other situations in dynamics where the p-adic numbers come up are surveyed in Ward [War]. 
At the same time there has been interest in studying the dynamics (topological, complex, or 
measurable) of naturally arising maps (such as polynomials) defined on the p-adics; see for 
example Benedetto [BenOl], Khrennikov and Nilson [KN04], and Rivera-Letelier [RL03]. In 
particular, Bryk and Silva in [BS05] studied the measurable dynamics of simple polynomials 
on balls and spheres on the field Qp of p-adic numbers. The maps they studied are ergodic 
but not totally ergodic and they asked whether there exist polynomials on Qp that define 
(Haar) measure-preserving transformations that are mixing. Woodcock and Smart in [WS98] 
show that the polynomial map x ^^ defines a Bernoulli, hence mixing, transformation 
on '^p. A consequence of our work is a significant extension of the result for this map, placing 
it in a greater context (see in particular Example 8.6). 

Rather than working on Qp we find that the natural setting for our work is over a non- 
archimedean local field K. We introduce a class of transformations, called locally scaling, 
and show in Lemmas 4.4 that measure-preserving (in particular, polynomial) maps are 
locally scaling. In Section 5 we apply the theory of Markov shifts to classify the dynamics 
of locally scaling transformations, decomposing the transformation into a disjoint union of 
ergodic Markov transformations and local isometrics. In particular, we show that a weakly 
mixing locally scaling transformation must be mixing. We also show the existence of poly- 
nomials defining transformations exhibiting nearly the full range of behaviors possible for 
locally scaling transformations, such as ergodic Markov, mixing Markov, and Bernoulli trans- 
formations. 

Given a polynomial defined on a compact-open subset of K, our work shows that a finite 
computation may check whether it defines a measure-preserving transformation and whether 
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it defines a mixing transformation; tlie question of crgodicity is also answered, except in tlie 
case wliere tlie polynomial is l-Lipschitz, which has been studied by Anashin in [Ana02]. 

We briefly mention related works studying measurable dynamics of certain maps on spaces 
related to the p-adics. These works [RB] , [FRL04] , and [FRL06] construct a natural invari- 
ant measure for a wide-class of rational functions, as in existing constructions in complex 
dynamics. The natural domain for these constructions is the so-called Berkovich projective 
space, a space much larger than the ordinary p-adics. 

We now indicate an outline of the rest of the paper. Section 2 reviews results on Markov 
shifts. Section 3 reviews preliminaries on non-archimedean local fields as well as some analytic 
definitions, and Section 6 recalls some of the theory of polynomial approximation on rings 
of integers of non-archimedean local fields. 

Section 4 estabhshes the fact that measure-preserving maps are locally scahng, and 
Section 5 proves our main structural results, in particular Proposition 5.5 and Theorem 5.6. 
Section 7, in particular Theorem 7.2, shows that polynomial maps arc in a sense a represen- 
tative class of locally scaling transformations, and demonstrates the existence of polynomial 
maps defining locally scaling transformation with various behaviors, including mixing. Sec- 
tion 8 and Section 10 are devoted to demonstrating two interesting classes of locally scahng 
maps on Zp that arise naturally in the study of polynomial approximations. Specifically, 
Section 8 studies maps which are isometrically conjugate to the natural realization of the 
(one-sided) Bernoulli shift, and shows for instance that the map x h- > (^^(j on is Bernoulli. 
Section 10 then studies similar binomial-coefficient maps which are locally scahng and so 
have very regular structures but fail to be Haar measure-preserving. 
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the 2005 SMALL summer research project at Williams College. Support for the project was 
provided by National Science Foundation REU Grant DMS - 0353634 and the Bronfman 
Science Center of Williams College. The authors would like to thank several anonymous 
referees for careful readings of the paper and valuable suggestions. 



Let if be a finite non-empty set. By a stochastic matrix on H we mean a map A : 
R>o such that 



Putting H into a bijection with the set {0, . . . , — 1} we may regard A as a x 
matrix with non-negative entries and the entries in each row summing to 1. In analogy 
with this case, we will refer to the sets {A{i, ■)} and {A{-,j)} as rows and columns of A, 
respectively. 

By a row vector on H we mean a map w : H ^M.. For w a row vector and A a stochastic 
matrix, we define their product as the row vector wA defined by 



2. Markov shifts 





We will say that w is non-negative (resp. positive) if it takes values in ]R>o (resp M>o)- 
To any stochastic matrix A we may associate the following symbolic dynamical system: 
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(i) Let 

Xa^ {x eY[H : ^(7r„(x), 7r„+i(x)) ^ for all n > 0} 

where 7r„ : Y[i>o H ^ H is projection to the n*'' coordinate. Give each finite factor 
the discrete topology, and Xa the subspace topology inherited from the product 

topolog 

(ii) Let Ta : Xa Xa be defined by 7r„ o T4 = Hn+i, that is, Ta is simply "shifting 
left." Then, {Xa,Ta) is a topological dynamical system. 

(iii) If in addition we are given a non- negative row vector w, then we may define a 
measure on Xa by 

e 

liA,w{[dodid2 . . . di]) ^ w{do)A{do,di) ■ ■ ■ A{di-idi), where [do...di] = Q 7r^^(ci„). 

n=0 

We call a set of the form [do . . . di] a cylinder set; we may observe that the cylinder 
sets form a base for the topology on Xa- Note that if w is in fact positive, then iia,w 
assigns positive measure to each cylinder set and hence to each open set. We may 
check that if -u; = wA then T4 is measure-preserving with respect to ^a,w 

We call such a dynamical system a Markov shift. We say that a dynamical system is 
Markov if it is isomorphic to some Markov shift. 

We say that a stochastic matrix A is irreducible or ergodic if for each i,j e H there exists 
a n e N such that A^[i,j) > 0. This condition has a natural interpretation in terms of the 
connectedness of a certain directed graph associated with A, as we shall see in the proof of 
Proposition 2.1. We say that a stochastic matrix A is primitive if there exists a n e N such 
that ^"(i, i) > for all i,j e H. 

Using the Perron-Probenius Theorem on non-negative irreducible and primitive matrices, 
along with a graph theoretic interpretation of the stochastic matrix, one may obtain an 
ergodic decomposition result for Markov shifts: 

Proposition 2.1. Let A be a stochastic matrix, and w a positive row vector such that 
w = wA. 

Then, we may partition H into disjoint sets 

n 
k=l 

such that 

(i) A{i,j) = for i G i^fc^J ^ with k 7^ i; and 
(n) Ak^ A\ 

HuxHk '^^ irreducible for A; — 1, . . . , n 
Then, — satisfies — WkA^. And we have the ergodic decomposition of{XA, IJ'A,w, Ta) 
as 

n 

iXA,fJ'A,w,TA) = \_\{Xa,^, HAk,Wk,TAk), 

k=l 

where Xa^. is viewed as a TA-invariant subset of Xa, so that lJ>Ak,wk — I-I'A,w\a^. ^'^^ Ta^. — 
Ta\a,- 

Moreover, the /c*'' summand is mixing if A^ is primitive. 
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Proof. Construct a graph on H as follows. We place a directed edge from i — j if and only if 
A{i,j) > 0. As w is strictly positive, this is equivalent to the condition that w{i)A{i,j) > 0. 
We say that the flow or flux associated to this edge is w{i)A{i,j). Now, the flow out of i is 

^w{i)A{i,j) = w{i) 

as A is a, stochastic matrix. The flow into i is 

^w{k)A{k,i) = w{i) 

keH 

as w — wA. So, we see that the flux into and out of i are both equal to w{i). 

This implies that for every finite subset of H, the in-flux and out-flux will be equal. For 
i E H, let R{i) be the set of points reachable from i, and B(i) the set of points which can 
reach i. Note that R{i) has out-flux by construction, and B{i) has in-flux by construction; 
as if is flnite, these subsets are flnite, so both have in-flux and out-flux equal to 0. 

Now, we can have no edges into or out of either of these two sets. But, if t G R{i) and 
y G B{i), then there is a path from y to t; so we must have t G B{i) and y G R{i), and so 
B{i) = R{i). So, B{i) = R(i) is strongly connected, and there are no edges into or out of 
this set. 

For £ > 0, note that A^{i,j) > is equivalent to there being a path of length precisely i 
from i to j. It follows that the collection 

{B(i) -.ieH} 

gives our desired decomposition of H. 

We readily note that Wk = w^Ak for for k = 1, . . . ,n. Then, as Aj^ is irreducible, [Wal82, 
Theorem 1.19] implies that the k^^ summand is ergodic, from which the ergodic decomposi- 
tion follows. Finally, [Wal82, Theorem 1.31] implies that the k^^ summand is mixing if A^ 
is primitive. □ 

3. Analytic definitions, preliminaries, and notation 

Let K he a non-archimedean local field, which we take to be either a flnite fleld extension 
of Qp or ¥pn[(t)) for some prime p. 

Let I • I be a non-archimedean multiplicative valuation (sometimes called a "non-archimedean 
absolute value") on such that | ■ | generates the topology on K. Denote V = | = {|a;| : 
X G K^}, O = {x E K : \x\ < 1} and p = {x E K : \x\ < 1} (note that (9, p are independent 
of the choice of valuation). It is the case that (9 is a ring with maximal idea p and that O /p 
is a flnite fleld (the residue field). Let p = charC/p, q = #0/p, both flnite with q a power 
of p. 

We denote 

Br{x) = {y E K : \x — y\ < r}, 

and call such a set (for any value of r) a ball. A ball of radius precisely r will be called an 
r-ball. Let // be Haar measure on K, normalized such that n{0) = 1; deflne p : V ^ M>o by 
p(r)=/x(S,(0)). 

Now, we recall the following standard results: 

(i) O is a discrete valuation ring with unique maximal ideal p; 

(ii) p = ttO for any tt G p \ p^; we call any such tt a uniformizing parameter; 
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(iii) V is the discrete abelian (multiplicative) subgroup of Q generated by |7r|; in light of 
this, we may define a map v : K ^ ZU {+00} defined by v{0) = +00 and v{x) ~ 
log|^l |a;| for x e K^; this is the additive valuation (sometimes just "valuation") on 

(iv) For r = IttI*^, A; > it is the case that 

p{r) = f,{Brm = a^O/p")-' = q-K 

Indeed, for r G V we see that p(r) = 

(v) A subset X ^ K is compact-open if and only if X is a finite union of balls. 

We direct the interested reader to [Ser62] for a thorough treatment of related topics. 

We will continue to use the symbols K, ij,,p,q,\ • |, O, p, tt, v, V, p, Br with these meanings 
below. 

Let X be an open subset of K and a E X. Then, we say that a function f : X ^ K is 
strictly differentiable or at a (denoted / G C^{a)) if the limit 

(x,y)^(a,a) X — V 

x^y ^ 

exists. We write / G C^{X) if / G C^(a) for each a E X. For more on this notion, see [Sch84] 
or [RobOO]. 



4. Measure preserving maps on non-archimedean local fields 

Definition 4.1. For X ^ K compact-open, we say that a transformation T : X ^ X 
is locally scaling for r G V if X is a finite union of r-balls and if there exists a function 
C : X ^ M>i such that 

|x — < r =r- \T{x) — T{y)\ = C{x)\x — y\ for x,y E X. 

We will refer to C as the scaling function. 

Remcirk 4.2. Let us make the following observations about locally scaling transormations: 

(i) By the symmetry of x and y in the previous displayed equation, C is constant on 
cosets of Br{0). We will write H = X/Br{0) for the set of cosets of -8^(0) contained 
in X (recall that X is a union of such cosets); we treat elements of H as subsets of 
X. Then, C induces a map C : H ^ K>i. 

(ii) The terminology "locally scaling" is convenient but perhaps slightly misleading: For 
us, such transformations must not only locally scale distances, but must do so by a 
factor that is at least 1. 

This definition is motivated by the ease of analyzing the structure of such maps together 
with the following easy lemma: 

Lemma 4.3. Let X <Z K he open, and suppose f G C^{X) is such that f'{x) ^ for 

X E X . Then, \f'{x)\ is locally constant on X. If, moreover, X is compact, f{X) C X, and 
I /'(a) I > I or all a E X, then the induced transformation f : X ^ X is locally scaling for 
some r G V. 
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Proof. Fix a e X. Since X is open and / e C^(a), there exists ra e V so that Br^{a) C X 
and 

/(^) - f{y) 



x-y 



/'(«) 



< |/'(a)| for x,y e BrSa), X ^y. 



By the strong triangle inequahty, it follows that \f{x) — f{y)\ = \f'{a)\\x — y\ for all x,y G 
Br{a). In particular, we note that \f'{x)\ is constant on Br^{a). 

If X is compact, then there is a finite set Qq, . . . ,ak & X so that Br^^ (oq), . . . , Br^^ (a^) is 
an open cover of X. Taking r < min^Lgra^., we see that X is a union of r-balls and that / 
restricted to each Br{a) scales distance by l/X^f)! ^ 1 by the previous displayed equation. 
So, / is locally scaling for r G V with scaling function C{x) = □ 

Lemma 4.4. Let X C K be open. Let f E C^iX) be such that f{X) C X and such that the 
transformation f : X ^ X is measure-preserving with respect to resiiX. Then, |/'(a)| > 1 
for all a E X . Furthermore, f is locally scaling for some r e V. 

Proof. Suppose there is an a e X with |/'(a)| < 1. Take a E V such that |/'(a)| < a < 1. 
As X is open, there exists r' E V such that Br'{a) C X and Br\f{a)) C X. Moreover, as 
/ E C^{a) we may take r e V, with r < r', such that 

\f{x) - f{y) - f'{a){x -y)\<a\x- y\ for any x,y E Br{a). 

It follows, by the strong triangle inequality, that \f{x) — f{y)\ < \x — y\a for x,y E Br{a). 
So, Bar{f{a)) ^ by construction and moreover /^^ {Bar{f{aj)) ^ Br{a). Taking measures 
we note that 

II {B^rifia))) = piar) < p(r) = p (5,(a)) < p (/"^ (i^a.(/(a)))) . 

So, / is not measure-preserving. 

This proves that |/'(a) > 1 for all a E X. The remaining part of the claim follows from 
Lemma 4.3. □ 

Remark 4.5. Note that for / E K[x], f E C^{K). So, if f{X) C X induces a measure- 
preserving transformation f : X ^ X, then Lemma 4.4 and Lemma 4.3 imply that / is 
locally scaling. 

Example 4.6. Consider the map / : Z2 — Z2 defined by 

^x\ x{x — 1) 



Then, 



{x - y){x + y - 1) 



2\x — y\\x + y — 1\ 

So, / is locally scaling for r = 1/2, since \x + y — 1\ = 1 when \x — y\ < 1/2. We will see in 
Section 8 that / : Z2 ^ Z2 is actually measure-preserving and in fact Bernoulli. 

Remark 4.7. Suppose / E K[x] defines a transformation f : ^ Zp. Since / is polyno- 
mial, we have / E C^i^^ and thus, by Lemma 4.3, is locally scaling for some r e V. We can 
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use the Taylor expansion of / to find sucli an r (tliis idea is similar to that in [KN04, p. 33, 
Lemma 1.6]). Specifically, writing 



f{x + z)-f{x) = zf\x) + Y,-f 

k=2 



we note by the strong triangle inequality that it suffices to choose r so that 



X 



k\ 



r'-' < \f'{x) 



for all X e Zp and n>2. 



5. Structure of locally scaling transformations 

Lemma 5.1. Let X <Z K be compact-open, with T : X ^ X locally scaling for r e V. Let 
C : X — > R>i be the scaling function of Definition 4-1- Then, for each a & X and r' e V 
with r' < r, the map 

is a bijection. 

Proof. Denote B = Bjj{a) and B' = Br'c{a){T{a)). As C is constant on Br{a), \Tx — Ty\ = 
C{x)\x — y\ = C{a)\x — y\ for a\l x,y E B C Br{a). This implies that T{B) C B', so our 
restriction is well-defined. It also implies that the restriction is injective. 

For each A; > we may take coset representatives qq, ■ ■ ■ , Ogfc-i for B/ B^i^^^^k^O). Then for 
i,j e {0, . . . , — 1} we have 

\T(ai) - T(aj) \ = C(a)\ai - %| > r'C(a)\7r\\ 

So, T{ao), . . . ,T{aqk_i) are precisely the q'^ coset representatives for B' / Br'c{a)\TT\''{^)- It 
follows that T{B) is dense in B'. 

Now, note that T\^ is continuous. So, T{B) is the continuous image of a compact set, 
thus compact, and so closed. So, T{B) = B'. This proves surjectivity, and the lemma is 
proved. □ 

Corollary 5.2. Let X,T,C be as in Lemma 5.1. Set H = X/Br{0). For any i,j G H, 
a e j, and r' e V with r' < r, the set 

ir\T~^ {Br' {a)) 

is either the empty set or a ball of radius r'/C{i), according as whether i fl T~^{j) is empty 
or not. 

Proof. Denote B — Br'{a). Assume i fl T~^{j) is not empty, so there is a y e i fl T''^{j). 
Then, the map 

T^iy) ■ i = Br{y) ^ BrCii){T{y)) DjDB 
is a bijection. It follows that i fl T'~^{B) is non-empty, and we may in fact assume that 

y eir\T-\B). 

Then, as 

T\B^,,,,^,(y)--Br'/ciM^BAT{y))^B 

is also a bijection, it follows that i fl T~^{B) = Br'/c(i){y)- D 
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Definition 5.3. Let X C K he compact-open, and let T : X — X be locally scaling for 
r e V. Let H = X/Br{0) and C : H M>i be the scaling function. Then, we define the 
associated transition matrix to be the map A : M>o given by, for i, j e 



inT-i(j) = 

p(l/C(i)) otherwise 



Lemma 5.4. Let X C K be compact-open and T : X ^ X be locally scaling for r e V; let 
H — X/Br{0) and let A : M>o be the associated transition matrix. Then: 

(i) For S Q X measurable and i & H 

i,(inT-\s)) = J2^^(snj)A(i,j). 

(ii) A(i,i) = ^M^nT-i(j)); 

(iii) A is a stochastic matrix on H; 

(iv) T is measure-preserving if and only if the sum of each column of A is 1. 

Proof. 
(i): 

By disjoint additivity of /x, it suffices to prove the equality in the case S Q j for some j e H. 
As the balls form a sufficient semi-ring in the Borel cr-algebra oi X, we may in addition 
assume that S is a. ball. Say S — Br' (a) for r' < r and a & j. Then, by Corollary 5.2 we 
know that ir\T~^{S) is either the empty set or a ball of radius r' /C{i), according as whether 
i n T~^{j) is empty or not. Taking measures we get 



i,{tnT-\s)) 



inr-^(i) = 

p{r'/C{i)) otherwise 
^l{S)A{^,3) = Y,^^{Sn3)A{l,3). 



(ii): 

Put 5" = j in (i) . Then we get 



|l{lf^T-\3)) = ii{3 f^3)A{l,3) = p{r)A{i,3). 

(iii) : 

Note that for each i e if, by disjoint additivity of along with (ii) we have 

p(r) 3) = 5^ /^(i n T-\3)) = i^{t nX) = ^{i) = p(r). 

j^H jeH 

(iv) : 

If T is measure-preserving then for each j e if we have, by disjoint additivity of /i, 

= 4tE/^(^^^"'0')) = -i-MxnT-\3)) = -^MT-\3)) = 1- 

PKn ^ PKn p[r) 



For the converse we use (i) and disjoint additivity: 

ieH iJeH jeH 

Proposition 5.5. Let X ^ K be compact-open, let T : X ^ X be a locally scaling trans- 
formation for r E V, with S = (X, fi, T) the corresponding measurable dynamical system. 

Let H = X/Br{0), let A : — > R>o be the associated transition matrix, and w : H ^ M>o 
the positive row vector given by w{i) = p{r) for i & H. Let E' — {Xa_, (j^a,w,Ta) be the 
corresponding Markov shift. 

Then, there exists a continuous, measure-preserving surjection $ : X — > Xa satisfying 
$ o T = T4 o $. Moreover, the pre-image under ^ of a cylinder set is a ball of the same 
measure. 

Proof. For each n > 0, let 7r„ : Xa H denote projection to the n*'' coordinate. Let 
(p : X ^ H he the canonical projection. Consider the map $ : X — > Xa defined by 

TT, O $ = O T", 

i.e., the n**^ slot in Xa denotes which element of H the point T^{x) is in. Then $oT = T^o$ 
by construction. 

Let (io,(ii, . . . e H. We will prove by induction on the number of slots specified (the 
"length" of the cylinder set [d^.-.d^) the claim that the pre-image of the cylinder set 
[dQ . . .d(\ is a ball of the same measure as the cylinder set. Note that $~^([(io]) — do is 
a ball of the correct measure as 

/^A,«;(Mo]) = w{do) = p(r) = ii{do) 

by construction. Now, 

^-\[do . . . del) = ^-\[do]) n {[d^ . . . di\) . 

By the inductive hypothesis, this is the intersection of two balls, and is thus again a ball; so 
$ is continuous. 

Noting that ^^^{[di . . .di]) C = di and applying claim (i) of Lemma 5.4, along 

with the inductive hypothesis, we see that this ball has the correct measure 

// {^-\[do . . . de])) = 5] (j n ^-\[d, . . . de])) A{do,j) 

^A{do,di)fi {<^'\[d,...d,])) 

= A{dQ, di)n{[di . . . de]) = n{[do . . . di]). 

As the cylinder sets are a sufficient semi-ring in the Borel cr-algebra of Xa, this shows that 
$ is measure-preserving. 

Note that $ continuous and measure-preserving implies $ surjective: X is compact and 
Xa is Hausdorff, so the image must be closed. However, the image must have full measure 
and so must be dense {w positive implies that all cylinder sets, hence all open sets, have 
strictly positive measure). □ 

Theorem 5.6. Let X, T, i/, S, S', $ be as in Proposition 5.5. Moreover, assume that E is 
measure-preserving, so that E' is as well. 
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Now, let H — Llfe=i -^ik be a decomposition of H in the sense of Proposition 2.1, so that 

n 



fc=l 

where S'^ = (X^^, /x^^,^, T4J. 
For k — 1, . . . ,n define 

~ I restriction ofYl to 4^Xa^ < 00 

^ I otherwise 

then we have an isomorphism of topological and measurable dynamical systems 

n 
k=l 

Moreover, each term in this decomposition is either locally an isometry or ergodic Markov, 
according as whether ^^Xa^. < 00 or not. 

Proof. By Lemma 5.4, E' is a quotient of E, so E measure-preseving implies E' measure- 
preserving. 

For k = 1, . . . ,n, denote = $~^(X^^), //^ = A*lcfc' ^fe — -^Icfc- decomposition of E' 
induces the following decomposition of E: 



k=l 



To complete the proof of the proposition, it suffices to show that {C^, Hk,Tk) = E^ for 

k — l,...,n as topological and measurable dynamical systems, and to classify them as 

being locally isometrics and ergodic Markov in the two cases. We now handle the two cases 

separately: 

Case 1: ^^Xa,, < 00 

If ^^Xai^ < 00, then the isomorphism {Ck, Hk,Tk) = E^ follows by definition. Note that the 
measure on E'^ is necessarily atomic; as it is ergodic, it must in fact be the inverse orbit of 
a single atom. As Ta, hence T^^,, is measure-preserving, each of the atoms must have equal 
measure. It follows that each element x e Xa^. is of the form 

X = {do, di, . . . , de, do,..., de, do, . . . ,di, . . 

with A{dQ,di) = A{di,d2) = ... = rfo) = 1- Then, = do, where C(do) = 1 

(here, C is that from the definition of locally scaling). 

So, Ck must be a collection of r-balls with C{x) = 1 for x G Ck- Then, for x,y & Ck with 
|a; — |/| < r we have \T{x) — T{y)\ = C{x)\x — y\ = \x — y\. This shows that E^ is locally an 
isometry, as desired. 
Case 2: ^Xa^ = 00 

If #Xaj, = 00, then we claim that $ induces an isomorphism {Ck, iJ>k,Tk) — E^. In a 
measure-preserving Markov shift, any atoms must have finite inverse orbit; so E'^ ergodic 
and H^Xa^. = 00 implies that lJ>Ak,wk non-atomic. Recall that $ is surjective. We claim 
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that it is also injective. For x & X let dn — 'KhT'^{,x) for n = 0, 1, Then, 

^-\^{x))^[\^-^{[d,...d,\). 

Wc have from Proposition 5.5 that each of these pre-images is a ball Then, $-^(<l>(a;)) is the 
intersection of a nested family of balls. If the intersection contains more than a single point, 
then the radii of the balls do not go to 0, and so the intersection has non-empty interior and 
thus positive measure. Now, the measure on X^^, is non-atomic, so iiAk,wk{^{^)) — ^- As $ 
is measure-preserving, this implies that fi[^~^(^(x))) = 0; by the above considerations this 
implies that $^^($(a;)) contains at most one point. So, $ is injective. 

Then, $ is a continuous, measure-preserving bijection. Observe that $ takes closed sets 
to closed sets by compactness, so its inverse is also continuous. This also implies that 
is measurable, and then $ measure-preserving implies measure-preserving. So, $ is an 
isomorphism of topological and measurable dynamic systems {Ck, jJ-k^Tk) = Sjt as desired. 
As the later is ergodic Markov, the former is as well. □ 

Corollary 5.7. Let X <Z K he compact-open and T : X ^ X a measure-preserving locally 
scaling transformation. If T is ergodic then it is either Markov or locally an isometry. 
In particular, if it is weakly mixing then it also Markov and so mixing. So, for a measure- 
preserving locally scaling transformation on a compact-open X, weakly mixing implies mixing. 

Proof. If T is ergodic, then the decomposition in Theorem 5.6 must be trivial. So, T must 
be either Markov or locally an isometry. If it is locally an isometry, then it cannot be weakly 
mixing. So, weakly mixing implies weakly mixing Markov which in turn implies mixing. □ 

CoroIIciry 5.8. For a locally scaling transformation, the following properties depend only on 
the associated transition matrix: 

(i) Measure-preserving; 

(ii) Weakly mixing, mixing, exact, Bernoulli. 

Proof. By Lemma 5.4, the property of being measure-preserving depends only on the asso- 
ciated transition matrix. 

Note that the decomposition in Theorem 5.6 depends only on the associated transition 
matrix. Given an associated transition matrix, we have the following cases: 

(i) The decomposition is trivial, and the system is a local isometry. Then, it is not 
weakly mixing (or any of the stronger properties listed). 

(ii) The decomposition is trivial, and the system is ergodic Markov. In this case, the 
system is determined up to isomorphism by the matrix. 

(iii) The decomposition is not trivial. In this case, the system is not ergodic and cannot 
satisfy any of the stronger properties hsted. □ 

6. Polynomial approximation in O 

The above results dealt with functions, extending to polynomial maps as a special 
case. In the next sections we will be interested in finding polynomial maps with specified 
associated transition matrices. In preparation for this, we will need some results on the 
approximation of continuous maps O ^ K. For the reader's convenience, we will sketch 
here the definitions and results of [Ami64], slightly simplified for our applications. 
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Say X O O is compact-open. Moreover, assume that X is a finite union of r-balls for 
r G V. Then, for r' < r each r'-ball contained in X is a union of precisely q balls of radius 
|7r|r' contained in X. In the terminology of [Ami64], this makes X a regular valued compact 
{compact value regulier in the original French). 

For k > log|^| r, we may define Hk = X/ B^^^k{0), and a projection map vr^ : X — Hk- 
Then, we say that a sequence {uk G X : A; G N} is very well distributed {tres bien repartie) 
if for each k > ^og^^^ r, h E Hk, and m > 1 we have 

#{i < m^Hk : Ui G h} = m. 

That is, the terms of the sequence must be equally distributed among the possible values 
mod p'' for k > log|^| r. Note that the condition that the {uk} are very well distributed 
implies that they are distinct. 

Now, given such a sequence {1^0,1*1, ■ ■ ■}, we may define the corresponding interpolating 
polynomials for /c > 0: 

pJx) 

Pk{x) = {x - uo){x - ui) ■ ■ ■ {x - Uk-i) and Qk{x) = — — -. 

Then, we may summarize some of the results of [Ami64, §11.6.2] as follows: 

Theorem 6.1 (Amice). Let X <Z O be compact-open, and let {uk} be a very well distributed 
sequence with values in X with P^, Qk the corresponding interpolating polynomials. Let 
f : X ^ K be continuous, and for k >0 set 

Then: 

(i) \ak\ as k ^ 00; 

(ii) Y,k>o(^kQk{x) f{x) uniformly on X ; 

(iii) The ak are determined by (ii); 

(iv) sup^.g;^ \f{x)\= supkeN Wk\- 

A very well distributed sequence {uk} is said to be well ordered {bien ordonnee) if |m„ — 
Um\ = |7r|^''*^"~"*^ for all n, m > where Vq{n — m) is the exact power of q dividing n — m G Z. 
Following our sources, we will call such a sequence T.B.R.B.O. {tres bien repartie bien 
ordonnee) . This allows us to state results of Helsmoortel and Barsky, characterizing Lipschitz 
and functions on O in terms of the coefficients in their expansions. This result may be 
found in [Bar 73]. 

Theorem 6.2 (Helsmoortel, Barsky). Let {ufc} be a T.B.R.B.O. sequence with values in O, 
with Pk, Qk the corresponding interpolating polynomials. Let f : O ^ K be continuous with 

/(^) = ^(^kQk{x) 

k>0 

the expansion of f in the sense of Theorem 6.1. For k >1, define 
Then: 
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(i) / is r-Lipschitz if and only if r < Hk\(ik\ for all k > 1; 

(ii) / e C^{0) if and only if Kk\ak\ as k ^ oo. 

Example 6.3. Note that {0, 1,2, . . .} C Zp satisfies the conditions for being a very well 
distributed sequence, and is in fact trivially T.B.R.B.O. Then, 

^ , , x(x — 1) ■ ■ ■ (x — k + 1) fx 
Qk{x) = - ' 



k-{k-l)----l \k 

So, in this case the above reduces to the Mahler expansion. 

More generally: Let ao, • • • .Og-i be a complete set of coset representatives for O/p. For 
A; e N, we will define Uk in terms of the base-g expansion of k: 

e e 

hq' I — ^ ^ ttkX = Uk- 

1=0 1=0 

Then, say we have n,m E N with n = ^j>o^i5* and m = Ylii>o^i^^ ■ ^ ~ '^^(^ ~ ~ 
min{i : rrii}. Then, 

l^n I 1^1 1^1 \ 

and {uk} is T.B.R.B.O. In particular, this implies that there is always a T.B.R.B.O. sequence 
for O, and corresponding interpolating polynomials such that the results cited in this section 
hold. 

Now, we establish a lemma that will be of particular interest to us: 

Lemma 6.4. Let {«„} he a T.B.R.B.O. sequence in O with corresponding interpolating 
polynomials Pk,Qk Then, for k e M; 

(i) Qk{0) C O; 

(ii) Qk is Kk-Lipschitz, with Hk o,s in Theorem 6.2; 

(iii) If k — q^ for some > 0, then 

\Qk{x) - Qk{y) \ = i^k\x - y\ for all x,y eO with \x - y\ < l/nk- 

Proof. Claim (i) follows by applying Theorem 6.1 (iv) with f = Qk (so that Qi — 1 for i — k 
and otherwise). Claim (ii) follows similarly from Theorem 6.2. 

Assume k = q^. Fix x,y E O with |x — 7/| < l/Kk, as the {«„} are very well distributed 
there is some m € {0, . . . , A; — 1} such that \x — Um\ i^k (hence also \y — Um\ < ^/ i^k)- 

Define a polynomial 

S iz)- V (l/ - Mq) • • • (l/ - Mil) • ■ • (y - Uj^) ■■■{y- Uk-l) 

^ n^^"— ^^i. {Uk - Uk-l) ■ ■ ■ {Uk - Uq) 

Then, we may observe that 

[Uk - Uk-i) ■■■{Uk- Uo) ^ 
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for any x',y' e O. Then, 

Qk{x) - Qk{y) = {Qk{x) - Qk{um)) - {Qk{y) - Qk{um)) 

k 

= (X - y)Si{Um) + ^Sj{Um) {{X - UmY - {V - UmY) ■ 

We will prove the following two statements, which together with the strong triangle in- 
equality and the previous expression imply our desired result: 

(i) \Si{um)\ = Kit; 

(ii) \Sj{urn)\ < kI for 1< j < k. 

That this suffices is clear, for the j — 1 term will dominate in valuation. 
Observe that 

{y-uo)---iy-Um)---iy- u^-i) 



and that 



Si{um) 



E 

0<ii<i2<"-<ij<k 



[Ur. 



{Uk - Mo) • • • (Uk - Mfc-l) 



[Ur 



Ui 



Ui 



[Ur 



Uk-l) 



{uk - Uk-l) ■■■{uk- Uo) 



Suppose {vn} is a very well distributed sequence. Then, it is easy to check that {vq, . . . , Wfc-i} 
must contain precisely elements bounded by tt^ for each i' < i. Now, observe that both 
{um — Un n & N} and {uk — Un '■ n E N} are very well distributed. Let m' be the unique 
index in {0, . . . , A; — 1} such that \uk — Um.'\ < l//tfc; the very well distributed property of 
{un} implies that this is in fact an equality. Our previous count implies that we must have 



{y -Uq)- ■ -{y -Um)- ■ -{y - Uk-l) 



So, 



{Uk -Uq)--- {Uk - Um') ■ ■ • {Uk - Uk-l) 

1 



1. 



Now, 



Sj{Um) 



Si{Um) 



\Sl{Um)\ = 



E 



Uk - Um' 



Kk- 



m^{il,...,ij^} 



^^j. {Um ^ii) ■ ■ ■ {Um "^ij-i) 



k ' 



for \um — MjJ, . . . , \um — Mj^^J > 1 / Kk as is very well distributed (and so the first k 

elements must be in disjoint l/K^-balls) . This completes our proof. □ 



7. Polynomial maps on O realizing locally scaling transformations 

Sections 4 and 5 characterize measure-preserving polynomial transformations on O in 
terms of locally scaling transformations. However, wc have shown the existence of only a 
handful of such maps. In this section, we will show that in fact the polynomials, in a sense, 
provide a representative class among the measure-preserving locally scaling maps. 
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We begin with a lemma giving sufficient conditions for two maps to fiave tfie same asso- 
ciated transition matrices. 

Lemma 7.1. Suppose T : O ^ O is locally scaling for r G V; set H = O/Br{0), let 
C : H R>i be the scaling function for T , and let A : — > R>o he the associated 
transition matrix for T . Suppose in addition that S : O O is a transformation such that 
the difference R — T — S satisfies 

(i) \R{x) — R{y)\ < C{x)\x — y\ whenever < \x — y\ < r; 

(ii) \R{x)\ < rC{x) for allx. 

Then, S is locally scaling for r & V , with scaling function C and associated transition matrix 
A. 

Proof. For < |x — y| < r we fiave 

\S{x) - S{y)\ = \T{x) - T{y) + R{x) - R{y)\ = \T{x) - T{y)\ = C{x)\x - y\ 

by tlie strong triangle inequality. Indeed, \T{x) —T{y)\ = C{x)\x — y\, which by (i) is strictly 
greater than \R{x) — R{y)\. So, S is locally scahng for r e V, with scaling function C. 

Now, it remains to verify that i fl T~^{j) = i fl S~^{j) = for i,j G H. For this, it 
suffices to show that T{Bj.{x)) — S{Br{x)) for all x & O. Indeed, applying Lemma 5.1 and 
(ii) yields 

T{Br{x)) = Brci.){T{x)) = BrC(.){S{x)) = S{Br{x)). □ 

For S C K we say that T : S K is affine if it is given hj x ax-\-b for some constants 
a,b E K. Wc say that T : O K is locally affine if for each x & O there exists a r G V 
such that T\^ ^-^.^ is afhnc. 

Theorem 7.2. Let r G V and H — O/Br{0). Let A be a stochastic matrix on H. Then, let 
Ta — {T locally scaling for r : A is the associated transition matrix for T}. 

IfT^is non-empty then: 

(i) Ta contains a locally affine transformation; 

(ii) Ta contains infinitely many polynomials. 

Proof. 
(i): 

Suppose T eTa- Let C : H ^ R>i be the scaling function in the definition of locally scaling 
and observe that its image is contained in V. Let S : H ^ K he any function satisfying 
\S{h)\ = C{h) for all he H. Then, for each h e H we observe that T{h) = {T{x) : x e h} 
and S{h)h — {S{h)x : x e K\ are both balls of radius C{h)r. So, there exists a function 
(indeed, many of functions) M : H K such that T{h) = S{h)h + M{h). Regarding S and 
M as functions with domain O via the quotient map O ^ H, we may define a locally affine 
transformation Ts,m : C — >■ C by the formula Ts,m{x) = S{x)x + M{x). 

Since -S", M are constant on elements of H and \S{h) \ = C{h), it follows at once that Ts,m 
is locally scaling for r with scaling function C . By construction, Ts^uih) = T{h) (which 
is also why Ts^mO C 0), so that Ts^m G X4 by the definition of the associated transition 
matrix. 
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(ii): 

Let T E Ta- By (i), we may assume that T is locally affine and hence strictly differentiable. 
Let {uk} be a T.B.R.B.O. sequence in O (which must exist by Example 6.3). Let 

be the decomposition of T in the sense of Theorem 6.1 

Take a eV with a < 1. By Theorem 6.2, HkWk] — > as A; — > oo, so there exists an e N 
such that ioT k > N we have HkWk] < a < 1. Moreover, take N such that N > ^H. 

Let 

N 

/(^) ^^(^kQk{x). 

k=0 

Note that / is a polynomial. Set 

R{x) = T{x) - f{x) = ^ akQk{x). 

k>N 

Our bound on «;fe|afe| along with the choice N > = q^°^\-^\^ implies that \ak\ < I/acat < r 
for k > N; so Theorem 6.1(iv) imphes that |-R(a;)| < r for all x E O. Lemma 6.4 implies that 
R is a-Lipschitz, so that \R{x) — R{y)\ < a\x — y\ for x,y E O. Noting that a < 1 < C{x) 
for all a; G O, we observe that we may apply Lemma 7.1 to conclude that f E Ta- 
Note that if 

9i^) = X] hQk{x) 

k>N 

is a polynomial such that Kk\hk\ < a, then the above argument also shows that f + g E Ta- 
So, there are indeed infinitely many polynomials in 7^. □ 

In particular. Theorem 7.2 shows the existence of measure-preserving mixing transforma- 
tions on the p-adics given by polynomial maps. We can also use this method to compute 
explicit examples of such maps, but it is not particularly enlightening to do so. 

8. Polynomial Bernoulli maps on O 

The construction of the preceding section gives infinite classes of measure-preserving poly- 
nomials with different kinds of measurable dynamics. Among these maps are Markov mixing 
maps. We will now study the class of such polynomials whose associated transition matrix 
has all entries equal, in which case the Markov transformation is in fact Bernoulli. The main 
upshot of this study is a class of explicitly given and relatively simple measure-preserving 
Bernoulli polynomial maps. 

Definition 8.1. We say that a measure-preserving locally scaling map T : C — > (9 is 
isometrically Bernoulli for r e V if it is locally scaling for r e V and all entries of the 
associated transition matrix are equal. 

Let V^O/p^^¥l and define 

Bv 




where /iy is the product probabihty measure, and Ty the left-shift. We may let d'y be the 
quotient metric on V. Then, we may define a metric dy on ni>o ^ 

dy ((ao, ai, 02, . . (^'o, ^'i, h, ■■■)) = k|"^^"'"^^o?'^(am, ^'m) where m = min{i : 7^ bi}. 
We give two justifications for this metric: 

(i) View elements of V as ^-tuples under the isomorphism W^^V corresponding to 
TT-adic expansion (i.e., the isomorphism induced by the map shown in (ii)). Then, 
expanding each element in the product to a £-tuple, dy is just the dictionary metric 
(with base |7r|). 

(ii) For each a eV we may let a e C be a coset representative for the quotient. Then, 
the map 

(ao,ai, . . .) I — > ^aln^' 

gives a bijection ni>o ^ ^ This metric is the unique metric making this map 
an isometry. 

Now, the term isometrically Bernoulli is partially motivated by the following: 

Lemma 8.2. Let T : O ^ O be a transformation and let £ > 1. Then, the following are 
equivalent: 

(i) T is isometrically Bernoulli for r = \n\^; 

(ii) For all x,y & O satisfying \x — y\ < \7i\^, 

\T{x)-T{y)\ = \nr'\x-y\. 

(iii) Let V = O/p^. There exists an invertible isometry $ : (9 — > ni>o ^ such that 
$ o T = Ty o $; that is, {O, T) is metrically isomorphic to By. 

Proof, (i)^(ii): 

Let H = O/Br{0), and A : ]R>o the associated transition matrix. Note that if T is 

isometrically Bernoulli, then each entry of A must be equal, and hence must be equal to 
^ = p{r). Now, T must be locally scaling for r e V, so \T{x) — T{y)\ — C{x)\x — y\ ior 

\x — y\ < r — |7r|^. But, we must have p{l/C{x)) — p{r), so C{x) — 1/r = |7r|~^. 
(ii)^(iii): 

Let V — O/p^. Now, (ii) implies that T is locally scaling for r. Letting A be the associated 
transition matrix, we readily note that all non-zero entries of A must be equal to p(|7r|^); as 
A is a stochastic matrix, this implies that all entries of A are non-zero. 

Now, let S' = {Xa, IJ'A,w,Ta) be as in Theorem 5.6. We see that By = S'. We observed 
above that all entries of A are non-zero; then, A is irreducible and Theorem 5.6 gives us 
a topological and measurable isomorphism $ : C — > X^- Note that the balls of X^ with 
respect to dy are just the cylinder sets. Moreover, one may check that for each m > 0, 
Xa is a disjoint union of balls of radius r = \7^\"^, which must then each have measure 
q~"^ = p{r). Then, Proposition 5.5 implies that takes balls of a given radius to balls of 
the same radius; moreover, must take each of the distinct balls of radius IttI™ in Xa 
to a distinct ball of radius IttI™" in O. So each ball of radius Ivrl™" in O must be the pre-image 
of precisely one ball of the same radius in Xa- It follows that $ and are both isometrics. 
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(iii)^(i): 

Note that for x, y G (9 we have 

\T{x)-T{y)\ = dv($(T(x)),$(T(y))) = dv{Tv{^{x)),Tv{^{y))). 

Then, for dv{^{x), $(y)) = \x — y\ < |7r|^ we compute 

dv{Tv{^{x)),Tv{^{y))) = |7r|-W($(x),$(y)) = \'k\-'\x - y\. □ 

Remcirk 8.3. Note that item (ii) of Lemma 8.2 imphes that T is |7r|~^-Lipschitz. 

Now, we may combine Lemma 6.4 with Lemma 8.2 to get: 

Corollciry 8.4. Let {uk} he a T.B.R.B.O. sequence with values in O, with corresponding 
interpolating polynomials Pk,Qk- Say T : O ^ O is given by the expansion, in the sense of 
Theorem 6.1, 

T{x) = "^auQuix), with at G O, \ak\ 0. 

fe>0 

Assume that 

(i) M = maXk>o K,k\ak\ exists, where is as in Theorem 6.2; 

(ii) There is a unique ku > attaining this maximum, and moreover it is of the form 

ku = q^ for some i > 1; 

(iii) \akj,j\ = 1 (hence, M = Kkj^J. 

Then, T is isometrically Bernoulli for r = 1/M e V. 

Proof. As /cm is the unique value attaining the maximum, the strong triangle inequality and 
Lemma 6.4 imply that 

\T{x)-T{y)\ = 

for \x — y\< 1/nkM — ^/M. Then, our claim follows by Lemma 8.2. □ 

Example 8.5. Let X = Qp and C = Zp. Then, {0, 1,2,.. .} is a T.B.R.B.O. sequence, and 
letting Pfc, Qk be the corresponding interpolating polynomials we can check that Qk{x) = (^) 
(cf. Example 6.3). In this case, Kk = pl-iogp^J, Therefore, we can rewrite the sufficient 
conditions in Corollary 8.4 as follows. Given T : Zp — > Zp defined by 

T(., = g„.(^) 

assume that 

(i) M = m£iXk>o\ak\p^^"^p''^ exists; 

(ii) There is a unique /cm > attaining this maximum, and moreover it is of the form 
kM — P^ for some i > 1; 

(iii) lajt^l = 1 (thus, M = p^). 

Then T is isometrically Bernoulli for r = p~^. In particular, the polynomials (p^) for i > 
clearly satisfy these conditions, and so each defines a Bernoulli transformation on Zp. 

Note, in particular, that this criterion applies to any map Zp Zp defined by m(^) +F{x) 

with M e Z^ and F G Zp[x]. An example of such a map is that given by from [WS98]. 
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^ak [Qk{x) - Qk{y)] 



k>0 



\X 



y\ — M\x — y\ 



In this context, the polynomials and ^y-^ are in a sense the most natural isometrically 
Bernoulli maps: 

Example 8.6. Take a set of coset representatives for Zp/pZp. Then, using Example 6.3 we 
may form a T.B.R.B.O. sequence, and then the p*'^ corresponding interpolating polynomial 
(and unit multiples of it) will be Bernoulli by Corollary 8.4. 

Let's look at the two most common sets of coset representatives for the quotient Zp/pZpi 

(i) Take as coset representatives 0, 1, 2, ... ,p — 1. The resulting T.B.R.B.O. sequence 
is {0, 1, . . .}. Then, Pp{x) = x{x - 1) ■ ■ ■ {x - p + 1) and Qp{x) = Q is the p^^ 
corresponding interpolating polynomial. 

(ii) Take as coset representatives and the (p — 1)*^* roots of unity (there are exactly 
p — 1 by Hensel's Lemma); these are called the "Teichmiiller representatives." Then, 
Pp{x) — x^ — X and 



Qp{x) 



Pp{x) _ 1 



X^ — X 



Pp(p) pp-^-1 p 

So, the polynomials and (up to unit) are analogs, arising by the same construction 
from the two most common choices for the coset representatives of Zp/pZp. 

Example 8.7. Let K = Fg((t)) and O = ¥q[[t]]. We may construct a T.B.R.B.O. sequence 
as in Example 6.3, having 0, 1, 2, . . . , g — 1, i as its first q + 1 terms. Then 

_ x{x-l)...{x-q + l) 
t{t-l){t-2)...{t-q + iy 

which is isometrically BernouUi by Corollary 8.4. However, t — 1, t — 2,...,t — q + 1 are all 
units in O, thus 

/ .V / / V x(x — 1) . . . (x — q + 1) 
t{t-l)...{t-q + l)Q,{x) = ^ ^—^ ^ ^ 

defines a Bernoulli transformation as well. 

Now, we will give two examples of isometrically Bernoulli polynomial maps on the rings of 
integers of finite extensions of Qp. First, we briefly motivate our choice of examples. For K 
a finite extension of Qp, let n — [K : Qp],/ — [O/p :¥p\, and e = log|^| \p\. It is a standard 
result that ef = n. It is evident that the nature of how O compares to Zp depends on the 
values of e and /. The two extreme cases are / = 1, e = n (in which case we say that the 
extension is totally ramified) and e = 1, / = n (in which case we say that the extension is 
unramified) . We give an example from each of these two extremes. For more background on 
the relevant theory, including the "standard" results invoked in this paragraph and in the 
following two examples see [Ser62], particularly Ch. I §7, 8., Ch. Ill §5, Ch. IV §4. 

Example 8.8. Take p > 2 and let 

K — Qp(Cp) where Cp is a primitive p*^ root of unity. 

It is a standard result that 1 — Cp t^^Y be taken as a uniformizing parameter and that 
the extension is totally ramified and so the set {0, . . . ,p — 1} gives a complete set of coset 
representatives for O/p. We may construct a T.B.R.B.O. sequence as in Example 6.3. The 
first p terms would be just 0, ... ,p — 1, with the next term 1 — (p. The first p^ terms would 
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be {i + j(l — (p)} for < i,j < p, with the next term (1 — (p)"^. Noting that q ~ p and 
applying Corollary 8.4 shows that the transformations defined by the polynomials 

x{x — 1) . . . {x — p + 1) 

and 



n {x-i-j{i-Q) 



^P^^ 0<i,j<p 

1 2 

are isometrically Bernoulli for r = |7r|^ = and r = |7rp = IpI^*"^, respectively. 

Example 8.9. Take / > 1 and let 

K = Qp(C) where C is a primitive (p-^ — 1)*'^ root of unity. 

It is a standard result that K is the unique unramified extension of Qp of degree /. So, p may 
be taken as a uniformizing parameter. Let ( e O/p be the image of ( under the quotient 
map. We note that ( must generate the residue field extension, i.e., O/p = Fp(C) = ¥p[(]. 
So, S — {oo + oiC + . . . + af-iC^~^}, with < oq, oi, . . . , a/-i < p, is a complete set of 
coset representatives for O/p. Applying the construction of Example 6.3 we may construct 
a T.B.R.B.O. sequence whose first q = p^ terms are precisely the elements of S, with the 
following term being p. Then, applying Corollary 8.4 shows that the transformation defined 
by the polynomial 

- TT {x-ao-aiC a/_iC^"^) 

0<ao,ai,...,ay_i<p 

is isometrically Bernoulli for r = \p\. 

9. Bernoulli maps on Z 

Define, as usual, 

Z = limZ/nZ = JJZp. 

n,| p 

We briefiy note that the results of this section allow us to produce examples of maps N — > Z 
which extend to Bernoulli maps on Zp for each p, and hence to a Bernoulli map on Z. More 
explicitly, we obtain the following Proposition: 

Proposition 9.1. Let Qq, Oi, . . . be a sequence of integers satisfying the following conditions 
for each rational prime: 

(i) \ak\p = 1 for k = p; 
(n) \ak\p< p'LiogpfcJ for k> p. 

Define f : N ^ Z by 



I — n V / 



fc=0 

Then, f extends to an isometrically Bernoulli transformation / : Zp ^ Zp for each prime p. 

Proof. For each prime p, note that the quantity p^^°^p''^\a,k\p attains its maximum for k = p 
(and for no other k) and that \ak\p = 1. Then, the result is immediate by Corollary 8.4. □ 
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Example 9.2. Let / : N ^ Z be defined by 

■ n 



p 



p<n p'<p 

where the summation is over primes bounded by n, and the product over primes bounded 
by p. In the notation of the Proposition, we have 

p'<p 

and ak = for k not a prime. So, for each prime p it is the case that \ap\p = 1. Moreover, 
for A; > p we see that \ak\p < p^^^Liugp^^J < p-[^ogpk\^ g^^ ^j^^ conditions of the Proposition 
are satisfied, and / extends to an isometrically BernouUi transformation Zp — > Zp for each 
prime p. 

Example 9.3. Let / : N ^ Z be defined by 



f{n) = ^{k-l)\ 

k<n ^ 

That is, we set ak = {k — 1)!'^. Then, |ap|p = \{p — = 1- And, ioi k > p we see that 

certainly \ak\p = \{k — < p~l-'°SpfcJ_ go^ the conditions of the Proposition are again 

satisfied. 

10. Polynomial almost Bernoulli maps on Zp 

An important condition shared by isometrically Bernoulli polynomials maps is that their 
derivatives must have constant valuation. We may use this observation to come up with 
a class of interesting non-examples. Our non-examples will be polynomial maps whose 
derivatives have constant valuation, but that are not measure-preserving. 

Proposition 10.1. Given n e N, the map j •.'L^^'L^ defined by 



n 



satisfies \f'{x)\ — C for some C e V and for all x & Zp if and only ifn — ap^ , with 1 < a < p 
and £ e Z>0; and 

- + ... + ^^ -^0 (modp) 

u u + a — 1 

for each it e {1, 2, . . . ,p — a}, where inverses mod. p are taken in ¥*. 

Proof. Write n = ap^ + r, a < p, r < p^. We first wish to show that r = 0. We may compute 

— r 



!/'(«) I = 



a 



/ \n\ for < a < n — 1. 



If r > 1, then (1 + ty = (1 -|- P^^^^l + ty ^ (mod p) has no ^ term, so we see that 
I /'(p - 1)1 < |/'(0)|. Thus, r = 0. Note that \f'{x)\ is constant on Zp iff it is constant 
on Z, since Z C Zp is dense. For a; G Z, wc observe that the set {x, x — 1, . . . ,x — n + 1} 
contains precisely n/p*^ terms divisible by p^ for k < £, and either zero or one term divisible 
by some higher power of p. By the strong triangle inequality, we note that < p^, 
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with equality if there is a term divisible by ^/^"^ . If no term is divisible by p^"*"^, then let 
itp^, . . . , (w+a— be the terms divisible by and observe that ti, . . . , u+a—\ 

are coprime to p. An easy computation shows that = "p^ if and only if 

a-l 

^(M + i)-^^0 (modp). □ 



i=0 



Remark 10.2. Suppose the hypotheses of the Proposition hold. Then, we may compute 
the value of ][x) (mod p) by performing a careful but easy computation involving cancelling 
corresponding powers in x{x — 1) • • • (a; — n + 1) (henceforth, "the numerator") and n\. The 
only terms which are not obviously matched are those correspoding to terms divisible by p^ . 
Suppose up^ ^ . . . , (m + a — are the terms in the numerator divisible by p^, where now we 
need not assume that xi, . . . , + a — 1 are coprime to p. Then, 

/(a;) ^ nLol!i±^ (modp). 

The simplest family of maps satisfying the hypotheses of Prop. 10.1 is that in the following 
Corollary: 

Corollary 10.3. Let p > 3, then n—{p — 2)p^ satisfies the conditions of Prop. 10.1 and so 

|/'(a;)| = p^ for all a; G Z 



Proof. We may verify the last condition of Prop. 10.1 by an easy computation. □ 

Example 10.4. Suppose p > 3, £ > 1, and n — {p — 2)p^. Define a transformation 
/ : Zp ^ Zp by 



n 



By the Corollary, we have that |/'(.t)| = p for all x G Zp. Mimicking the proof of Lemma 6.4 
we can show that / is locally scaling for r = Moreover, since (p — 1)! = —1 (mod p), 

the computation of the previous Remark yields 

{0 if x = 0, 1, . . . , n — 1 (mod p^"*"^) 
1 if X = n, n + 1, . . . , n + p^ — 1 (mod p^+^) 
— 1 if X = n + p^, . . . ,p^'^^ — 1 (mod p^'^^) 

Identifying i G {0, 1, . . . ,p^~^^ — 1} with i +p^"'"^Zp, the associated transition matrix for / is 
thus 



if i = 0, 1, . . . , n — 1 (mod p^^^) and j = (mod p) 
(A(i, j))o<ij<p<;+i = <J i if i = n,n + l,...,n + p^-l (mod p^+^) and j = 1 (mod p) 

- if i = n + p^, . . . , p^"*"^ — 1 (mod p^~^^) and j = —1 (mod p) 



p 

L 

P 
1 

P 

So, if p 7^ 3, we see that / is not measure-preserving. However, A does have a left- 
eigenvector of eigenvalue 1: 



, if i = (mod p) 
w = (w(i))o<i<p€+i = < / 



p if i = ±1 (mod p) 
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This corresponds to an /-invariant measure /i on Zp defined on any /^-measurable set 5" C Zp 

by 

ms) = ^^11 (s n BJO)) + -ii(sn BJi)) + -fi(Sn bJ-i)) . 
p p p 



Then, the map $ defined in Proposition 5.5 gives a measurable isomorphism of {Zp, n, f) 
with {Xa, IJ'A,w,Ta), where the latter dynamical system is mixing Markov. 
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